This course seeks a balance between foundational 
but relatively basic material in algorithms, statistics, graph theory and related fields, with real-world 
applications inspired by the current practice of 
internet and cloud services. Specifically, this 
course will look at social & information networks, 
recommender systems, clustering and community 
detection, search/retrieval/topic models, 
dimensionality reduction, stream computing, and 
online ad auctions. Together, these provide a 
good coverage of the main uses for data mining 
and analytics applications in social networking, 
e-commerce, social media, etc. The course is a 
combination of theoretical materials and weekly 
laboratory sessions, where several large-scale 
datasets from the real world will be explored. 
For this, students will work with a dedicated 
infrastructure based on Hadoop & Apache Spark.
Outcome: Not Provided